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pi I Abstract: We present an implementation of the so-called Ckkw-l merging scheme for 

^ ■ combining multi-jet tree- level matrix elements with parton showers. The implementation 

uses the transverse-momentum-ordered shower with interleaved multiple interactions as 
implemented in PYTHIA8. We validate our procedure using e"'"e~ -annihilation into jets 
and vector boson production in hadronic collisions, with special attention to details in the 
algorithm which are formally sub-leading in character, but may have visible effects in some 
observables. 

We find substantial merging scale dependencies induced by the enforced rapidity ordering 
in the default PYTHIA8 shower. If this rapidity ordering is removed the merging scale 
dependence is almost negligible. We then also find that the shower does a surprisingly 
good job of describing the hardness of multi-jet events, as long as the hardest couple of 
jets are given by the matrix elements. 

The effects of using interleaved multiple interactions as compared to more simplistic ways 
of adding underlying-event effects in vector boson production are shown to be negligible 
except in a few sensitive observables. 

To illustrate the generality of our implementation, we also give some example results from 
di-boson production and pure QCD jet production in hadronic collisions. 
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1. Introduction 

Production rates for multi-jet events at the LHC are very large, and the understanding 
of such events is important, not least as most discovery channels for new physics involve 
jets. The main irreducible, and often huge, background for such signals comes from QCD 
processes. To distill a signal one therefore needs to make complicated cuts to decrease the 
QCD background, sometimes by several orders of magnitude. For this it is very important 
that we have a good understanding, not only of the average behaviour of multi-jet processes, 
but also the fluctuations and very rare events coming from standard QCD. 

The state-of-the-art for simulating multi-jet final states with Monte Carlo event gener- 
ators is to use CKKW-based algorithms to combine exact tree-level matrix elements (ME) 
with parton showers (PS) in a consistent way. Here, the matrix elements describe accurately 
the production of several hard, well-separated partons, while the parton shower encodes 
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how these are evolved into partonic jets by accurately modelling the soft and collinear 
partonic emissions, in a way such that standard hadronisation models can be applied to 
produce realistic exclusive hadronic multi-jet final states. 

However, Ckkw merging algorithms mainly focus on the jets produced in the primary 
interaction, and little attention is normally given to jets which may arise from rare, but 
hard fluctuations in the underlying events. If at all, the underlying-event contribution is 
typically added to the merged sample assuming that the additional scatterings are com- 
pletely independent of the primary interaction. This may be a good approximation in most 
cases, but it is clear that there are correlations between the primary interaction and the 
underlying event, which we think are important to investigate carefully. 

The multiple interaction model in PYTHIA8 is arguably the most advanced model for 
the underlying event today. It contains several sources of correlations between the primary 
interaction and the underlying event. In particular, the model for multiple scatterings is 
tightly tied to the parton shower in that additional scatterings are interleaved with the 
parton evolution. 

In this paper we implement the Ckkw-l algorithm for merging parton showers with 
tree-level matrix elements in PYTHIA8, and in doing so we consider possible effects of the 
fact that the PYTHIA8 shower is interleaved with multiple interactions. Although the effects 
turn out to be small, we note that there may be more sources of correlations which are 
currently not taken into account by PYTHIAS, and our scheme is a way to automatically 
take into account any such correlations also in the merging with tree-level matrix elements. 

It should be noted that this is not the first implementation of matrix-element merg- 
ing with the PYTHIA shower. Interfaces exists for the FORTRAN version of PYTHIA 
to the AlpGEN [1] program by employing the MLM matching prescription [2], and to 
MADGRAPH/ MADEVENT [3] using so-called Pseudo-Shower merging [4]. 

The outline of this article is as follows. First in sections § and |3| we briefly recapitulate 
the main features of the Ckkw-l merging scheme and the interleaved showers respectively, 
before we describe the details of our PYTHIAS implementation in section^ Then we present 
results in section |5|, starting with some control plots to validate the implementation before 
we study the effects of multiple interactions and other formally sub-leading features on the 
production of vector bosons with additional jets at the LHC. We end with showing some 
comparisons with data, and some preliminary results also from di-boson and pure QCD jet 
production. Finally we present our conclusions in section ^. 

2. The Ckkw-l merging scheme 

Here we will present the main features of the Ckkw-l merging procedure. For a more 
detailed discussion of Ckkw-l and other similar merging algorithms we refer to [5,6] and 
the original publications [7,S]. 

The starting point for Ckkw-l is that we have a tree-level matrix-element generator 
capable of generating the Born-level process of interest, as well as the same process with 
up to additional partons. The matrix elements used are regularised with a jet cutoff 
which we refer to as the merging scale, tMs- To the states generated in this way we want to 
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add a parton shower to dress up the hard partons with emissions below the merging scale 
in a way such that the soft and collinear emissions arc properly modelled. 

As the matrix elements are inclusive, in that they give the cross section for states with 
at least n additional partons resolved above the merging scale, it is obvious that we cannot 
simply add the event samples generated with different parton multiplicities. Instead we 
want to make the samples exclusive by reweighting them with Sudakov form factors taken 
to be the no-emission probabilities the parton shower would have used to produce the same 
partonic states. 

To calculate the form factor we first have to reconstruct a parton-shower history for 
the states with n additional partons, S+n, given by the matrix element generator. This 
means that we have to answer the question, how would m,y parton shower have generated 
this state? The answer to this question is not necessarily unique. The parton shower may 
produce a given final parton state in several ways, just as a given state may be represented 
by many different Feynman diagrams. In Ckkw-l, these different path are considered by 
reconstructing all possible parton shower histories, and picking one of them according to 
probabilities calculated from the relevant splitting functions. 

Doing this, we arrive at a history, in which a sequence of parton shower emissions are 
specified by the ordering scale of each emission pi and other splitting variables such as 
the energy fractions, and azimuthal angles, denoted by Zj. We also obtain a sequence of 
intermediate parton states, S'+j. The requirement on the parton shower is therefore that 
it must have complete on-shell intermediate parton states between each splitting. Until 
fairly recently this was only true for the ARIADNE program [9] , which also was the first to 
use the Ckkw-l merging [8]. 

Let us denote Sudakov form factors by 



This is the probability that there are no parton shower emissions from the state iS+j between 
the scales pi, and pj+i . The reweighting with Sudakov form factors now proceeds by starting 
the parton shower at a given intermediate state S'+j, setting pi as the maximum scale, and 
generating one emission {p,z). The probability that this emission is above pj+i is exactly 
1 — As_^_i{pi, Pi+i), so throwing away the event if the emission is above p^+i is equivalent 
to reweighting with the Sudakov form factor. 

A special treatment is called for in the Sudakov between the last emission scale, p„, 
and the merging scale, in the case the cutoff in the matrix elements is not defined in terms 
of the parton shower ordering variable. In the case of n < N, the event is rejected if the 
trial emission from the state is above the matrix clement cut-off, irrespective of how 
it is defined. In the case of n = A^, however, no Sudakov-reweighting is done. 

The n-parton state is typically generated using matrix elements with a fixed as{p), so 
that we also reweight the event with 
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to obtain the same running of as as in the shower. 

FinaUy, note that for initial-state parton-shower sphttings, the no-emission probabil- 
ity His not the same as the Sudakov form factor needed to reweight the matrix-element 
generated state. Instead we have [10,11], 



and the corresponding ratios of parton density functions are included as an additional 
weight. 

We have thus constructed exclusive final states with an arbitrary number of partons 
resolved above the parton shower cutoff scale pc- The distribution of these states are 
resummed to all orders in as, according the precision of the parton shower. However, the 
n < N emissions which are considered hardest in the parton-shower sense, and are above 
the merging scale as well, will have their splitting functions corrected to reproduce the 
correct tree- level matrix element. 

It should be noted that if the merging scale is defined in the same way as the parton 
shower evolution scale, the Ckkw-l is equivalent to standard Ckkw, as long as the latter 
is used with a shower which is properly vetoed and truncated [12]. In Appendix ^ we 
elaborate on how the logarithmic accuracy of the shower is preserved in Ckkw-l and 
compare with the case of standard Ckkw using truncated showers. 

3. Interleaved showers 

As mentioned in the previous section, the requirement on a parton shower to be used 
in the Ckkw-l procedure is that it gives complete on-shell partonic states between each 
emission. In this respect, the transverse-momentum ordered shower in PYTHIA8 [13] is 
perfectly suited. However, it is not completely straight forward to implement Ckkw-l 
with Pythias, as the parton shower in the case of hadron collisions is interleaved with 
multiple interactions. 

The philosophy behind the interleaved shower is that processes with a high scale in 
some sense happen before processes at lower scales. As the emissions in a parton shower 
are not completely independent in that every emission will give rise to recoils and will 
carry away some energy and momentum, it is important that the emissions are performed 
in the right order. It is, for example, not reasonable that an emission of a gluon with 
small transverse momentum removes so much energy as to make an emission with a higher 
transverse momentum impossible. The argument is based on formation times — a final 
state parton with large transverse momentum is to some extend formed long before one 
with small transverse momentum. 

If we consider standard QCD jet production in proton collisions, a parton shower 
is typically initiated by a hard 2 — )• 2 matrix element at some transverse momentum. 
The parton shower then evolves these hard jets by emitting final-state radiation from 
the outgoing partons and initial-state radiation from the incoming partons. This is done 
iteratively, ordering the emissions in transverse momentum. 




(2.3) 
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There is also a chance for a second (semi-)hard interaction between the colhding pro- 
tons. Also, one of the outgoing partons from the hard interaction can rescatter with one 
of the spectator partons in one of the colliding protons, and in addition, outgoing partons 
are allowed to rescatter among themselves. In PYTHIA8 such scatterings are included in 
the shower procedure such that an additional scattering at a scale pui will happen before 
e.g. an initial-state splitting at a scale, pi < pMi- 

This means that the no-emission probabilities are modified in PYTHIA8, and now 
consist of several pieces, 

Us^M^p) = U^sUPi^pMlM^P)'nsUPi,P), (3.1) 

where the superscript refers to the standard parton shower (PS), multiple interactions (MI) 
and rescattering (RS). If we have resolved a state S+i at a scale pi, the probability for a 
change of type a at scale p is 

V%p) = P%p) X Afjp„p)Afl{p,,p)Afjp„p), (3.2) 

where is the inclusive probability. 

Pythias uses an interleaved treatment of spacelike (initial-state radiation — ISR) and 
timelike showers (final-state radiation — FSR), so that the no-emission probability H^^. 
is further subdivided as 

u^slM^p) = nf_J(ft,p)n|SR(p,,p). (3.3) 

The ordering scale, p, is defined in different ways for different processes, but they all 
correspond to a relative transverse momentum of emitted partons. For ISR the scale is 

piSR = {l-z) Q\ (3.4) 

where —Q^ is the virtuality of the incoming original parton and z is its momentum fraction, 
and for FSR we have 

/SR = z{\ -z)Q\ (3.5) 

where is the invariant mass of the radiating parton, and z the energy fraction (in the 
dipole rest frame) of the emitted parton. For MI and RS the scale is simply given by the 
squared transverse momentum of the emitted partons. 

The full interleaving of all shower components makes PYTHIA8 ideal for our prescription 
of matrix element merging, since the full no-emission probability can, as will be explained 
below, easily be generated in only one step. 

4. Implementation in PYTHIA8 

Due to the requirement of fully on-shell intermediate states, Ckkw-l merging has so far 
only been implemented in the ARIADNE shower. Here, we present a new implementation 
within Pythias, which is conceptually equivalent to the former, but differs in details 
relating to the differences in the parton showers. 
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4.1 Constructing the parton shower history 

A key concept of the merging algorithm is the assignment of a shower history — a sequence 
of shower states and evolution scales — to each n-particle configuration supplied by the 
matrix element generator. In the Ckkw-l approach, this is done by constructing every 
possible path from a core Born-level process to the current n-particle state. 

Here we encounter the first difference between ARIADNE and PYTHIA8. The first gluon 
emission is particularly simple in e"'"e~ in ARIADNE, where the evolution variable and the 
splitting kernel for the first splitting are symmetrical between both outgoing legs, thus 
resulting in only one possible path: One dipole splitting into two dipoles. In PYTHIA8, the 
approach is slightly different. Also here a dipole-like approach is used, but the emission is 
explicitly divided up into two contributions stemming from each of the dipole ends, where 
the radiation close to one end of the dipole is considered more likely to come from this 
dipole end itself. Different splitting probabilities for either dipole end will thus result in 
two diff'erent ways in which PYTHIA8 could have arrived at the +l-parton state. In general 
there are more possible paths in PYTHIAS than in ARIADNE. We therefore try to investigate 
in some detail the effects of different ways of choosing a path. 

What we basically want to do is to reconstruct which Feynman diagram gives the 
largest contribution to the state produced by the matrix element generator. The preferred 
option would be to ask the matrix element generator itself, but this information is not 
always easily accessible. Even if such details were available, it is not always enough, as 
a given Feynman diagram may also correspond to diff'erent parton shower histories. As 
is discussed in Appendix ^, we approach this issue by constructing all possible path of 
collinear splittings, and pick a path according to the product of splitting probabilities. 
For the hardest emission, the splitting probability is supplemented so that the matrix 
element transition probability is assured. More precisely, we choose a path according to 
the probability 



Pip : Splitting kernel for splitting number i in path number p, 
Pip : Evolution scale of splitting number i in path number p, 

Zip : Energy fraction carried by the parton emitted in splitting number i in path number 



wip : Improved splitting probability for hardest splitting, including weights of ME correc- 
tions in the shower. 

The precise forms of these terms are derived in Appendix |^, where we also elaborate on 
how the intermediate states 5+j in path p are constructed. 

It must be noted that in the limit of strong ordering, which is the relevant limit when 
looking at the formal logarithmic accuracy of the procedure, picking the most likely path 



Wlpizip) Ui=2 




where 
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is trivial. Hence, the way a path is selected will only give sub-leading effects on any 
observable. We will nevertheless investigate how large these effects are by implementing 
two different schemes. One is similar to the original ARIADNE-implementation, and is based 
on eq. ( [4.11 ). The other is inspired by the CKKW-implementation in HERWIG++ [14], where 
the path which has the smallest sum of transverse momenta in the splittings is chosen 
exclusively. Clearly, in the strongly ordered limit, both of these will find the "right" path, 
but as we will see in section ^ there are visible differences. 

For higher jet multiplicities, minor complications of the path concept arise. First, we 
know that shower emissions are always ordered in some scale variable p (virtuality, angle, 
transverse momentum). This is not always true for consecutive clusterings of jets from a 
matrix element. We choose to interpret a sequence of such unordered splittings as a single 
step in the algorithm such that all steps will be ordered. We must then decide which scale 
to use for this combined emission step. Assume that we have a sequence of reconstructed 
scales given by pi > > P2 > Pi- The combined emission then corresponds to p2 and 
and we can generate the total no-emission probability as 

^s+{po,Pa) = Iis+o{po^ Pi)^s+Api^ P^)^s+z{p-i, Pa) (4.2) 



or 



^S+{PO,Pi) =^S+o{PO,Pl)^S+APl^P2)^S+3{P2,P4) ■ 



(4.3) 



In the former case, the no-emission probability between the scales and p2 is calculated 
using the 1-parton state, while in the latter, the 3-parton state is used. We will investigate 
the difference between using the higher [p^) or lower scale (/02) as minimal scale for rejecting 
trial emissions off the 1-parton state in section 5.1. 

Some rare matrix element config- 
urations, e.g. massive electroweak cor- 
rections to an underlying QCD process, 
as shown in Figure |l], could never have 
been produced in the shower algorithm. 
For such processes, clustering will be at- 
tempted as far as possible. The last, 
irreducible, state will be treated as a 
new hard process, and be assigned a 
shower starting scale in the same way 
Pythias normally would have assigned 
a scale when presented with such pro- 
cesses. When handling externally gen- 
erated processes, PYTHIAS would by de- 
fault start the evolution at the factorisation scale defined in the matrix element evaluation 




Figure 1: An example of a matrix element contri- 
bution without a complete shower history. In this 
case, only the two gluon emission can be reclustered, 
cc — ^ udW" is regarded a separate hard process. 



However, different user choices are allowed. In section |5.l| , we will also investigate the 
effects of other scale choices. 

On a more technical note, we disallow clusterings that will result in a unreasonable 
Born-level process. An example would involve starting from the configuration shown in 
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(b) 

Figure 2: An example of two different ways the iterative clustering may interpret of a particular 
colour configuration in a dg — > W^uuug process. Since (b) has disconnected external particles, no 
valid shower history can be found. 

Figure |2| (a). From only recombining colour and flavour, alternative (b) would be identical, 
and, albeit being disconnected, allowed. The interpretation of the configuration as either 
(a) or (b) is tied to which uu pair is clustered to a gluon. Since we will always be able to 
find sensible paths like (a), impossible paths (b) leading to disconnected diagrams will be 
discarded. 

In other merging prescriptions, these problems are addressed with other strategies. 
In HERWIG++ [14], the authors found that results where insensitive to the treatment of 
unordered or incomplete paths and chose to retain incomplete contributions. SherpA 
[15] follows a different approach in that no incomplete histories are constructed, since if 
necessary, electroweak bosons will be clustered as well. This would interpret the diagram 
in Figure |l| as an electro-weak matrix element correction to di-jet production. 

4.2 Interleaved multiple interactions 

At the LHC, events with only one parton-parton scattering per collision are highly improb- 
able, and a lot of effort has gone in to the modelling of multiple scatterings in PYTHIA8. 
When merging the PYTHIAS shower with matrix elements, it is therefore desirable to keep 
the modelling of multiple scatterings as intact as possible. 

In Pythias, multiple interactions and radiation compete for the available phase space. 
To make sure that some part of phase space is exclusively filled by matrix element configu- 
rations, another part by shower radiation and multiple interactions, we minimally modify 
the Ckkw-l algorithm. The generation of no-emission probabilities has to be slightly 
refined to keep the effect of multiple interactions on the no-emission probability, while 
assuring the validity of our algorithm. 

The formal proof that the merging scale dependence cancels to the accuracy of the 
shower rests on the assumption that the factorisation scheme defined by the shower evolu- 
tion equation is uniform over all of phase space. In Ckkw-l, this is realised by allowing trial 
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emissions in the matrix element domain, i.e. off reclustered states, without phase space re- 
strictions, and vetoing events if the first emission off a ME configuration produced another 
ME configuration, i.e. a parton above tus- In this way, events with non-zero weight have 
been treated identically in the ME and PS regions. This prescription has to be generalised 
to include additional sources of emissions, e.g. multiple interactions. 

The requirement that the shower evolution is identical in ME and PS domains forces 
us to treat multiple interactions on equal footing with radiative emissions, once secondary 
scatterings are included in the evolution of partons by allowing for competition over phase 
space. When performing trial showers on a reclustered state, we thus treat multiple in- 
teractions identical to "ordinary" emissions. The treatment of the first emission off ME 
configurations defines the border between ME and PS regions. We choose to slightly refine 
this definition by requiring that the matrix element region contains only radiative emissions 
above a cut tms- This means that once a different type of emission has been produced, we 
are in the parton shower domain, and we should continue the shower without any additional 
phase space restriction. More concretely, when checking the first shower evolution response 
from a ME configuration, we keep the state if an emission below or a secondary scat- 
tering has been generated. Hence, the lower bound on the matrix-element-corrected region 
is changed to t'^^g = max(tMS) Pmi)- The reason for this treatment is that we want to keep 
hard multiple interactions generated by the shower, rather than unjustifiably restricting 
them to be below ^ms- 

Let us describe our procedure with a specific example for merging up to three additional 
jets. Consider a W -|- 3 gluon event, with scales p\> p2> P3- When only allowing QCD 
radiation and multiple scatterings, this state could be produced by 

1. Three gluon emissions off W production; 

2. One gluon emission off W production, and one secondary gg — > gg or qq — > gg 
scattering; 

Clearly, the first possibility can and should be corrected with matrix elements according to 
the standard Ckkw-l procedure. In the second case, the hardest scale can be attributed 
to either MI (pi = p^i = P2 > Ps) or an emission. In the former, we think of the state 
as inside the PS domain. This means that the shower would have produced the secondary 
interaction first, "freeing" the subsequent emissions from phase space restrictions. Thus, 
we have to generate this state from the 0-jet matrix element, and, to avoid double counting, 
veto it in trial showers off reconstructed configurations. If the hardest scale was associated 
with an emission (pi > p2 = pmi = pz)-, we can distinguish two cases. If the hardest 
emission is in the PS domain already, there is no reason to restrict the event generation 
further by disallowing MI above a certain scale. In effect, the configuration is taken from 
the evolution of the 0-jet ME sample, while removing it from the 1-jet sample by vetoing 
configurations with Pmi > Pi,recius in the trial showers. Finally, the emission with pi can be 
in the matrix element phase space. Adding one secondary interaction will produce a state 
of two correlated 2—7-2 processes. Since no matrix elements can include this state, it is 
unambiguously inside the PS region, even without applying additional constraints related 
to a merging scale. This reasoning leads us to define the cross-over of ME and PS domains 
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by a phase space cut for emissions, or the existence of more than one 2 — )• 2 process. Coming 
back to the example, we will generate this state from the 1-jet matrix element by adding a 
secondary scattering. In order to avoid double counting, in trial showers off reconstructed 
states, we veto the event if the trial emission resulted in pi^rccius > Pmi > /02,rccius- 

This example illustrates the algorithm and sheds light on how particular configurations 
are generated. The bottom line is that every event where the n hardest (according to the 
parton shower ordering) partons can produced in one of the matrix elements samples, it 
will be taken from this sample. Hence, we are still true to the philosophy of Ckkw-l 
merging. Note that in this publication, we will only consider merging matrix elements 
with additional QCD-induced jets. Therefore we will e.g. treat photon radiation in the 
shower in the same way as to multiple interactions. 

To validate our algorithm, we chose to implement an alternative treatment of multi- 
ple interactions, which is similar to the prescription applied in SherpA [15]. For this, we 
exclude multiple interactions when performing trial showers on reclustered states, keeping 
only the shower emissions in the Sudakov form factors. Then, when showering the matrix 
element configurations, we allow additional interactions below the scale pi of the reclus- 
tered 2 — 2 process. For the +0 jet contribution, we choose po = tus as maximal scale. 



Differences between both treatments are investigated in section p.l . 
4.3 The algorithm step-by-step 

After choosing a parton shower history for the matrix element state, the weight the parton 
shower would have generated while evolving to this state has to be calculated. This in- 
cludes the running of in the shower, the no-emission probabilities generated by choosing 
particular splittings and the way parton distribution functions guide the space-like evolu- 
tion. In the Ckkw-l scheme, a seamless inclusion of ME configurations into the parton 
shower is then achieved by reweighting the state with the parton shower weight 

W^CKKWL = ^o/o~(^0-Po) X I TT '^tfH^t'Pi) ^7 fri^i 'Pi) j 

fn {^n ff^jp) fn {^n :fJ'jr) \ ^ ^ ^i—l fi — li^ i—l ^Pi) ^i — lfi — li^i — l:Pi) I 

\i=l / 




H^S+^.M-UPi)] >^^S+Apn,tMs) (4.4) 



^4 = 1 



■^n fn i-^n ^ Pn) fn i-^n i Pn) 
Xn fit {Xn , p'p) Xn fn {Xn , P^p) 

asjpi) xt_ 1 /+ 1 (x+_ 1 , Pj- 1 ) xr_ ^fr_^{x-_^, pi-i) 
OsME x+ i/+i(x+ i,pi) x:r_jr_^(x~_^, Pi) 



n 



X 

1=1 



^s+Apn,tMs) , (4.5) 



ns+,_i(/5i-l,Pi) 

where pi are the reconstructed scales of the splittings. The first PDF ratio in eq. (^ 
means that the total cross section is given by the lowest order Born-level matrix element, 
which is what the non-merged PYTHIA8 shower uses. The PDF ratio in brackets comes 
from of the fact that shower splitting probabilities are products of splitting kernels and 
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PDF factors. The running of is correctly included by the second bracket. Finally, the 
event is made exclusive by multiplying no-emission probabilities. In our implementation, 



we chose to reorder the PDF ratios according to eq. (4.5), so that only PDFs of fixed flavour 
and x-values are divided, thus making the weight piecewise numerically more stable. The 
algorithm to calculate and apply this weight can be summarised as follows: 

I. Produce Les Houches event files (LHEF) [16] with a matrix element generator for 
n = 0,1 ... N extra jets with a regularisation cut-off, tMS) typically using a fixed 
factorisation scale, fip, and a fixed ctsME- 

II. Pick a jet multiplicity, n, and a state 5„ according to the cross sections given by the 
matrix element generator. 

1. Find all shower histories for the state pick a sequence according to the 
product of splitting probabilities. Only pick un-ordered sequences if no ordered 
sequence was found. Only pick incomplete paths if no complete path was con- 
structed. 



2. Perform reweighting according to eq. (4^): For each ^ i — 1 < n, 



i. Start the shower off the state Si-i at Pi-i, generate a trial state Ri with 
scale pr^. If > Pi, veto the event and start again from II. 

ii. Calculate the weight factor 

asjpi) xtift-ii^t-v Pi-i) xr_jr {xr pi_^) 
Wi-i = T T T (4.5) 

OsME Xi_J^_^{x^_^, Pi) Xi_Ji_^{x^_^, Pi) 

3. Start the shower from Sn at pn, giving a state Rn+i with the scale Pr„^-^^. 

i. If n < N , and Rn+i was produced from Sn by QCD radiation, and 
k±[Rn+i) > ^MS! reject the event and start again from II. Otherwise, accept 
the event and the emission and continue the shower. If a multiple interaction 
was generated, keep it and continue the shower without restrictions. 

ii. If n = A^, continue the shower without vetoing. 
III. If the event was not rejected, multiply the event weight by 

Xnfni-^n^Pn) X^ fn {Xrn Pn) 



X 



(4-7) 



Xn fn {Xn , p'p) Xn fn {Xn , Pp / ^_ 

IV. Start again from II. 

Our merging approach is, with dynamically generated Sudakov factors, tailored to al- 
ways reproduce what PYTHIA8 would most probably have done to arrive at the current 
configuration. Starting scales are of course no exception. Thus, we will start (trial) show- 
ering of electroweak 2—7-2 processes at the kinematical limit -y/s, both for radiation and for 
multiple interactions, which is the default procedure in PYTHIA8. In this way, the question 
for a starting scale of multiple interactions when merging additional emissions is irrelevant. 

For jet production in the pure QCD case, by default we set the transverse momentum 
of the outgoing partons in the 2—7-2 process as starting scale in the shower and multiple 



- 11 - 



interactions. This should be adequate as long as the merging scale is not too small. For 
very small merging scales we have the option of including a Sudakov form factor giving the 
probability that no additional scatterings are produced between the maximum scale, y/s^ 
and the transverse momentum of the 2—7-2 process. This would make the primary process 
exclusive, in the sense that we make sure that there are no harder scatterings in the event. 

Note also that in pure QCD, the Born-level 2 — )• 2 process is in itself divergent and we 
must introduce a cutoff regularisation. This cutoff need not be the same as the merging 
scale. In fact we will here choose a much lower scale to avoid having a large fraction of the 
reclustered multi-jet ME-states ending up below the cut and resulting in un-ordered paths. 
In addition, the procedure must be changed slightly since also the scale of the reclustered 
2 — )• 2 state is included in the classification of un-ordered histories. 

In all cases, we implemented the scale settings such that user choices (e.g. forcing 
"power showers") are always transferred to the trial showers off 2 — )• 2 processes. For 
higher-order tree- level matrix elements, we use the reconstructed splitting scale of the 
state as starting point. 

When comparing alternative MI treatments, special care is required when setting the 
starting scale. For the SHERPA-inspired prescription, we will set the scale pi of the reclus- 
tered 2 — )• 2 process as the MI starting scale for states 5+n>0; and allow multiple scatterings 
below po = *Ms for the -|-0 jet matrix element contributions. 

5. Results 

We have implemented the necessary code for Ckkw-l merging in PYTHIA8, where it has 
been publicly available as of version 8.157. 

In the following, we will first show some validation plots on parton level for jet produc- 
tion in e''~e~ collisions and weak boson production at hadron colliders. We then move to 
more realistic observables for these processes, and compare to data. Thereafter, di-boson 
production and pure QCD jet production are examined. 

As input matrix element kinematics, we choose Les Houches Event Files generated 
with MADGRAPH/MAD Event and the following settings^: 

• Fixed renormalisation scale /i^j = Mz. 

• CTEQ6L1 parton distributions used for hadron collisions. 
as(Mz) = 0.118 for lepton collisions and to Qs(Mz) = 0.129783 for hadron collisions. 



Fixed factorisation scale p^p set to Mw for W-l-jets, Mz for Z-|-jets, Mw + Mz for 
WZ-Fjets and Mz for pure QCD di-jets. 

Durham/fc^-cut 

2 . min |2 • m.m.{Ef , E'j){l — cos %)| for e"'"e~ — >-jets 

min I min {p\ ^ , ^ ) , min {p\ ^ , p^. - ) ^^'''^ pi^'^'' ^ | for pp(pp) {Y+) jets. 



^Note that the values of as and the factorisation scales used here are somewhat irrelevant, as they will 



nevertheless be divided out in eq. (4.5) 
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with D = 0.4, to regularise the QCD divergences and act as merging scale t^s- 

• Require pT^i > 20 GeV in Z+ jets to avoid low momentum in 7 propagators. 

• Require pxj > 10 GeV in QCD di-jet events. 

For brevity, we will refer to results of merging of up to additional jets as MEA^PS. 
Contributions for a fixed number n < N of jets from the matrix element will be indicated by 
a superscript n, as in ME"A^PS. Also, we will write PYTHIAS when talking about the default 
Pythias behaviour. For all distributions, we use routines of the fast jet package [17] to 
define and analyse jets. If not otherwise indicated, we present plots at the parton level, 
i.e. after shower and multiple interaction evolution, since merging effects are more visible 
without smearing due to hadronisation. 

5.1 Validation 

We begin by considering the simplest case, with only one extra parton added to the Born- 
level state. This is a very useful benchmark for any matching or merging algorithm, as 
emphasised in [5], because many parton shower programs, such ARIADNE and PYTHIAS, 
implement directly the tree-level matching by modifying the splitting functions for the first 
emission. Hence, when comparing a merged parton shower with the matched one, it is very 
easy to see if the merging algorithm, for example, has any non-trivial dependence on the 
merging scale. 

Merging scale dependence in e^e~ — t- jjj 

The Pythias parton cascade by default includes reweighting of the first splitting of the 
hard process with the correct matrix element expression, thus giving an excellent handle 
to check our implementation of e'''e~ — )• jets. To compare our result with PYTHIAS, we 
however have to make a minor change to the shower. When supplied with a e^e~ — )• qq 
state, Pythias will use the three body matrix element as splitting kernel for the first 
splitting of q and the first splitting of q. This is done since the e^e~ — )• qqg matrix 
element provides a better estimate of the dipole splitting kernel than the DGLAP kernel. 
However, when starting from e^e~ — )• qqg input, PYTHIAS will use DGLAP kernels in 
the evolution of the quarks. Thus the showers response to LHEF input of e^e~ — )• qq 
and e^e~ — )• qqg will slightly differ when constructing additional jets. Since we want to 
merge also higher jet multiplicities with the PYTHIAS cascade, it is natural to exclude the 
improvement in the e"'"e~ — )• qq case, and switch off the usage of matrix element correction 
weights for more than three final partons. In the most recent versions of PYTHIAS, such a 
switch is available for user input. 

Doing this, we can compare MEIPS with PYTHIAS. The variable used as a separation 
cut between matrix element and parton shower domains is most sensitive to the imple- 
mentation of the merging procedure. In Figure ^ we show the value of k± for which three 
jets would be clustered to two jets. As desired, we find excellent agreement, and, when 
examining different values of the separation cut ^ms) vanishing merging scale dependence. 
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Figure 3: fc^ separation of the tliird jet in e+e^ collisions at ii^cM = 91.25 GeV. Jets were 
defined with the Durham algorithm. Hadronisation was switched off. The bottom in-set shows 
the deviation of the merged samples for three different merging scales ^ms with respect to default, 
matrix-element-corrected P YTHIA8 . 

Merging scale dependence in pp — V + 1 jet 

Similarly, the implementation of V-l-1 jet merging can be validated against default 
Pythias. In accordance with the discussion above, we switch off additional matrix el- 
ement reweighting factors in default PYTHIAS after the first initial state emission. Further, 
it is important to note that in PYTHIAS, infrared divergences in space- like splittings are 
regularised by shifting the denominator of the integration measure in the evolution equation 
by a small Preg- This shift is inspired by the interleaved evolution of space- like splittings 
and multiple interactions, where colour screening will dampen the number of interactions. 
Not strictly perturbative effects like these will be present in the default PYTHIAS distri- 
butions, even at ~ 0(10 GeV). That the merging is well under control is shown in 
Figure |^, where we set p^eg = for the first splitting in default PYTHIAS to remove the 
deliberate mismatch in integration measures. We then find complete agreement in the k± 
distributions. 

Influence of the prescription on how to choose a shower history 

That different prescriptions to choose amongst reconstructed histories differ only by sub- 
leading terms is exemplified in Figure |5[ We see a small merging scale dependence when 
always choosing the history with the smallest sum of transverse momenta. The smallness 
of the effect stems from the fact the probabilistic choice — on average giving the "correct" 
shower history — is dominated by a - factor, so that picking a history by lowest scale p 
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Figure 4: Transverse momentum of the hardest jet in W + 1 jet events at Ecu = 7000 GeV in 
pp coUisions. Jets were defined with the /c^-algorithm with D = 0.4. Multiple interactions and 
hadronisation have been switched ofi^. The bottom in-set shows the deviation of the merged sample 
with respect to default matrix-element corrected PYTHIA8, for three different merging scales. 

or probabilistically almost equally well answer the question "how would my parton shower 
have generated this state". 

Variation when changing the starting scales for un-ordered histories 

In the following, we refrain from setting the infrared regularisation parameter preg to 
zero. When facing histories with unordered emission sequences, different ways to assign 
an emission scale to the combined splitting are conceivable, as discussed in section 
To investigate this we turn to two-jet merging, the lowest non-trivial jet multiplicity at 
which non-ordered histories may occur. Figure ^ highlights that when choosing the lower 
scale as a common scale, the transverse momentum of the second jet has a harder tail 
compared to setting the higher of both scales as the scale of the combined emission. Also, 
back-to-back jets are more prominent. This is an effect of the reweighting with a running 
coupling constant, which produces a more pronounced enhancement of the cross section 
when choosing smaller scales. For all further results, we will use the larger scale when 
evaluating as{p). 

Variation due the choice of starting scales for incomplete histories 

Figure shows the consequence of adopting different shower starting scales for incomplete 
histories. Particularly the consistency of distributions for pQ = p^p = (80.4 GeV)^ and 
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Figure 5: A comparison of different prescriptions of choosing the history for e''"e — 3 jets. Results 



for choosing in a probabihstic way, with sphtting probabihties defined in eqs. B.IO and B.ll 



labelled "ckkw-1" , while adopting a winner-takes-it-all strategy of picking the history with lowest 
scale carries the label "scale". The plots were produced with a merging scale tMs = mm{k±i} = 
10 GeV. Hadronisation was switched off. The left panel shows the fc^-separation (in the Durham 
algorithm) between the third and second hardest parton in the first (reconstructed) emission. The 
distributions for ME°1PS and ME^IPS for a scale-dependent choice are shown in the upper part, 
whereas the bottom in-set gives the deviation of both prescriptions from default PYTHIA8. In the 
right panel we show the A;_L-separation the third and second hardest jets defined in the exclusive 
Durham algorithm for the probabilistic approach, with the bottom in-set again giving the deviation 
of both prescriptions from default PYTHIA8. 



po = s = (1960 GeV)^ allows to conclude that the dependence on the starting scale for 
incomplete emissions is negligible, which reflects the fact that the corresponding states are 
very rare. 



Differences between treatments of multiple interactions 

Different treatments of multiple interactions are presented in Figure which illustrates 
that at the LHC, variations of up to 10% may occur between the default Ckkw-l recipe 
and the SHERPA-inspired alternative. Due to the phase space restriction pQ = t^s for 
additional scatterings in Z -|- jet matrix element samples, the alternative treatment 
produces fewer multiple interactions. Thus, the k±i spectrum for intermediate scales 
15 GeV < k±i < 30 GeV is softer than the Ckkw-l result. At scales k±i > 50 GeV, 
the two prescriptions become indistinguishable. The behaviour at low scales is also antic- 
ipated, since the alternative sample does not include suppression due to Ml no-emission 
probabilities. Since these are present in default PYTHIA8, the alternative recipe exhibits 
a higher maximum, whereas the default prescription reproduces the showers low scale fea- 
tures closely. 

Our goal when developing a generalisation of the Ckkw-l method including interleaved 
showers was to be as similar for low scales to the event generator as possible, meaning that 
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Figure 6: Two different -ways of choosing a combined scale for unordered emissions, in W + 2 
jet events at Ecu = I960 GeV in pp collisions. The merging scale is tms — 10 GeV. The curves 
are labelled -with ">" when assigning the higher scale ^combined ~ inax(pi, p^+i), and -with "<" 
■when assigning the lo"wer scale Pcombincd = ^^^{pi, Pi+i), as the combined scale of t-wo unordered 
emissions. The bottom in-sets sho"w the deviation of the lo"wer scale sample "with respect to the 
higher scale sample. Jets -were defined in the /cj^-algorithm with D = 0.4, while multiple scatterings 
and hadronisation were turned off. The left panel shows the azimuthal difference A0i2 between the 
hardest and second hardest jet. The right panel shows the k± separation k±2 of the second hardest 
jet. 



the modelling of PYTHIA8 in regions "where multiple interactions are important should be 
left unchanged. As pointed out in 4^, this can be formally be achieved in PYTHIA8 by 
employing the "Ckk"W-l" prescription. The discussion of the last paragraph also sho"wed 
that in the implementation of the method, low scale features of PYTHIA8 are retained. 
Hence, we choose the "Ckk"W-l" prescription of adding the influence of multiple scatterings 
as the default. As can be inferred from Figure |l^ belo"w, this method succeeds in not 
changing the underlying event description of PYTHIAS. 

Because in "weak boson measurements at lo"w scales, the shape and position of maxima is 
unchanged in the Ckkw-l approach, we also minimise the need for changes of some tuning 
parameters, e.g. primordial p±. This is not obviously true for the alternative method, in 
which some changes in primordial p± might be necessary. Meanwhile, once hadronisation 
is added and experimental cuts are applied, Z+ jets observables at the Tevatron show only 
little dependence on the strategy how multiple interactions are included in merged samples. 



Unitarity violations 

We finish our validation by discussing a theoretical issue. Parton shower resummation 
alone does not change the cross section of the hard process, since the probability of having 
no emission, together with the sum of probabilities to evolve into states with an arbitrary 
number of emissions adds to unity — a property dubbed unitarity. This however is only 
true if the transition probabilities used in generating additional emissions are identical to 
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Figure 7: Azimuthal difference A(j>i2 between the liardest and second tiardest jet, for three different 
ways of choosing the starting scale po for incomplete histories, in W + 2 jet events at Ecu = I960 
GeV in pp collisions. The merging scale is tus — mm{k±i} — 10 GeV. The curves are labelled 
for Po = p,"^, "sHat" ii po = s and "s" if po = s. Jets were defined by the fcj^-algorithm with 
D = 0.4. Multiple scatterings and hadronisation were switched off. The bottom in-sets show the 
deviation of the po — s and po = s samples with respect to the p^ sample. 

the terms exponentiated in Sudakov form factors. As pointed out in [18,19], unitarity is 
violated by tree level merging due to the fact that the transition probabilities above and 
below are different, while Sudakov factors are always generated with shower splitting 
kernels, i.e. the transition probabilities below tMs- The magnitude of the resulting unitarity 
violations for different merging scales is assessed for W+ jets in Table |^. We have also 
verified that the main points of the following discussion apply to all example processes used 
in this report. 

First, we note that including one additional jet does not lead to unitarity violations for 
vector boson production, since PYTHIA8 is already matrix element corrected, so that the 
full tree- level splitting probability is exponentiated. When including more than one jet, we 
observe smaller deviations from the hard process cross section as we increase the merging 
scale. This is expected since for larger Ims, the Sudakov form factors generated by trial 
showering quickly approach unity. Because of the higher merging scale, phase space regions 
with low scale emissions (where Sudakov factors differ from unity) are generated by the 
parton shower. Thus, identical splitting probabilities are used to generate the emissions and 
Sudakov form factors, and unitarity is preserved to reasonable accuracy. One immediate 
consequence is that we should not choose tms too low, since otherwise, sizable violations 
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Figure 8: Two examples for differences in the treatment of secondary interactions, for Z + 2 jet 
events at Ecu = 7000 GeV in pp collisions. The merging scale is Ims — inm{k±i} = 10 GeV. Jets 
were defined with the fcj^-algorithm with D — 0.4. The left panel shows the R separation Ai?i2 
between the hardest and second hardest jet, with the bottom in-set giving the difference of the 
SHERPA-inspired sample with respect to the Ckkw-l treatment. The right panel shows the jet k± 
after hadronisation, when clustering to exactly one jet. Ratios of the two MI treatments to default 
Pythias are found in the bottom right in-set. 
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Table 1: Impact of changing the merging scale t-^s and maximum number of jets on the process 
cross sections, for three different processes, c+c^ — > jets is evaluated LEP energy {Eqm — 91.25), 
and cross sections for pp — Z-t-jets and pp — W+-fjets are calculated at Tevatron Run II en- 
ergy {EcM = 1960). Results were produced with PYTHIA8 Tune 4G. Multiple interactions and 
hadronisation were switched off. 



can occur. 

Unitarity violations give a measure of how well the shower splitting probability, inte- 
grated over the PS phase space (ordered in the evolution variable), captures the matrix 
element features and the allowed phase space. Different choices of PS evolution variables 



-19- 



Process 




2^2 


MEIPS 


ME2PS 


ME3PS 


ME4PS 


pp W++jets 


10 GeV 
15 GeV 
30 GeV 
45 GeV 


1034(1) pb 


1037(3) pb 
1034(3) pb 
1034(3) pb 
1034(3) pb 


1048(3) pb 
1043(3) pb 
1038(3) pb 
1036(3) pb 


1047(3) pb 
1044(3) pb 
1038(3) pb 
1036(3) pb 


1045(3) pb 
1043(3) pb 
1038(3) pb 
1036(3) pb 



Table 2: Impact of changing the merging scale t-^s and maximum number of jets on the W+ 
jets cross sections in pp colhsions at Eqm = I960. Multiple interactions and hadronisation were 
switched off. Results were produced using Tune 4C, with enforced rapidity ordering switched off. 
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Figure 9: Comparison of the k± of the first jet in W+ jets events at Ecu — I960 GeV in pp 
collisions, between default, first-order corrected PYTHIA8, and Ckkw-l, for different number of 
merged jets and different merging scales. Jets were defined with the fc_L-algorithm with D = 0.4, 
clustering to exactly one jet. Multiple interactions and hadronisation were switched off. Left panel: 
Results when using Tune 4C, which by default includes ordering emissions in rapidity as well as p. 
Right panel: Results when using Tune 4C, with enforced rapidity ordering switched off. 



can lead to different regions of the full phase space — which includes unordered emissions — 
being neglected in the parton shower approximation. These regions of unordered emission 
sequences are formally beyond the accuracy of the shower. Figure ^ shows the differences 
in transverse momentum distributions between merged distributions and PYTHIA8, for two 
different ways of ordering emissions. Deviations from unitarity are more significant if the 
shower evolution is ordered both in p and rapidity. This is due to neglecting larger regions 
of the full phase space in the parton shower. We have verified that when only keeping ME 
configurations for which a history ordered in p and rapidity can be constructed, unitarity 
violations are greatly reduced. Nonetheless, we have to conclude that ordering the cascade 
in these two variables makes the parton shower approximation worse than ordering in p 
alone. Only ordering in p, we find in Table |2| that the inclusive 2—7-2 cross section is not 
changed drastically when including additional jets. Also, the k±i spectrum becomes only 
slightly harder in this case, as is seen in the right panel of Figure |9[ 

It should be noted that the rapidity ordering was introduced in PYTHIA8 to suppress 
the high transverse momentum emissions from dipoles between incoming and outgoing 
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partons. However, this is now achieved through another damping mechanism described 
in [20], which means that the rapidity ordering is no longer needed to achieve a reasonable 
description of data. 

We checked that deviations from unitarity can be even further reduced when exclud- 
ing unordered emissions. However, in Ckkw-l, we want to include states which are out 



of the reach of the shower, and thus, as discussed in section 4.1, by default keep ME con- 
figurations for which only unordered histories can be found. For enthusiasts, switches for 
rejecting configurations with unordered histories are available in the public code. Provided 
considerable unitarity violations remain after excluding differences between the full allowed 
and the PS phase space, this could suggest large higher-order corrections, since by choosing 
a low merging scale we effectively include major parts of the real emission phase space of 
an NLO calculation [21]. It should be noted that unitarity is a parton shower concept and 
need not be fulfilled in other contexts, see e.g. [22]. 

5.2 e"'"e~ four-jet observables 



Modified Naditmann-Reiter angle (parton level) Angle between the two softest jets (parton level) 




Figure 10: Four jet angular correlations in e+e^ collisions at £'cm — 91.25 GeV, as measured by 
OPAL [23]. Up to four additional jets were included in the merged samples. Effects of hadronisation 
are included. Left panel: Modified Nachtmann-Reiter angle \cos6nji\. Right panel: Angle between 
the two lowest energy jets cos 0:34. The plots were produced with RIVET [24]. 

Merging procedures aim for a better description of well separated jets in the parton 
shower. Historically, angular correlations in e''~e~ — )• 4 jet production have been used to 
investigate the 3-gluon vertex. The description of these observables should be improved 
when including additional jets. More specifically, we look at the the (modified) Nachtmann- 
Reiter angle 

(5.1) 



I cos 9nr\ 



\Pl -P2I \P3 -Pil 
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and the angle between the two lowest energy jets 



cos 0:34 = , (5.2) 

where pi are the energy ordered three- vectors of the jets. As shown in Figure 10, the default 
Pythias description of these observables is fairly good to start with, reflecting the fact 
that some azimuthal correlations are included in the shower, and it is only slightly changed 
when merging additional jets. We notice that |cos0Ari?| becomes slightly worse when in- 
cluding additional jets. The different shape of the generator curves can be explained by 
the fact that the data was corrected to the parton level, whereas the MC samples where 
generated with full hadronisation. In \cos9nr\, the hadronisation corrections [23] would 
change the MC shapes towards a better agreement. cosa34 is captured slightly better for 
cos 034 ^ —1, when including additional jets. The trend to overshoot at cos 034 > 0.5 
can again be explained by the fact that we have generated the distributions at the hadron 
level, whereas the data was corrected to the parton level. We have checked by excluding 
hadronisation that these statements are true, and that the irregularities are reduced. How- 
ever, the general trends in both \cos6nr\ and cos 034 remain, albeit less pronounced. This 
might be explained with the fact that the hadronisation corrections applied to the data are 
estimated with a model different from the one used by PYTHIAS. Since the cross-over from 
partonic to hadronic states is a highly model-dependent statements, artifacts of the model 



used to estimate corrections could be present in the data. Even so, we think Figure |10 
illustrates that when including higher-order tree-level matrix elements in the description of 
e^e~ — )• jets, changes as compared to the default shower are fairly modest, which indicates 
that Pythias already nicely describes observables at LEP. When checking further LEP ob- 
servables, we find that Ckkw-l does as good or moderately better than default PYTHIAS. 
This means that when developing a new tune including additional matrix elements, the 
hadronisation parameters, which are predominantly constrained at LEP, may not have to 
be touched. 

5.3 Vector boson production 

In hadron collisions, we can assess the extent of change when including additional jets by 



looking at vector boson production with two or more additional jets. In Figures |11| and |12 
we compare jet k± spectra and jet multiplicities for W production and in Drell-Yan events 
to data, respectively. In general, we find more jets with high k± and better agreement with 
jet multiplicity data. 

It is particularly instructive to investigate the change of k± distributions when in- 



creasing the numbers of jets in the matrix element generation. Figure 13 again shows 
that the k± spectra develop harder tails when including higher multiplicity matrix element 
configurations. 

Analysing the k±2 separation when two jets are clustered into a single jet in the right 
panel, it is interesting to see how this increase arises. For small merging scales (e.g. 
10, 15 GeV), k±2 in two-jet merging quickly grows at the merging scale and remains flat 
until a more gradual ascend sets in at k±2 ~ 60 GeV. There, the ME2PS distributions 
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Figure 11: Jet multiplicity and transverse momentum of the hardest jet in W + jet events as 
measured in the electron channel by ATLAS [25]. The merging scale is tMS — inin{fc_Li} = 15 GeV. 
Effects of multiple scatterings and hadronisation are included. The plots were produced with 
Rivet [24]. 
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Figure 12: Jet multiplicity and inclusive jet transverse momentum in Drell-Yan events, as mea- 
sured by CDF [26]. The merging scale is ^ms = 30 GeV. Effects of multiple scatterings and 
hadronisation are included, and Tune 4C was chosen. The plots were produced with RIVET [24]. 

for = 10,15 GeV also join the curves for larger merging scales (30,45 GeV). This 



behaviour of ME2PS for low tyis can more clearly be seen in the left panel of Figure |14 . 
When inspecting the ME3PS curves for ^ms = 10, 15 GeV, we again see a hardening of 
the spectrum, which is to some extent stable when going to ME4PS. Such a stabilisation 
inspires the conclusion that the k±n<N separation between the n'th and (n— l)'th hardest 
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Figure 13: k± of the hardest and second hardest jet, for different number of merged jets, in W+ 
jets events at Ecu = I960 GeV in pp coUisions. The merging scale is defined in Ims = minjfcj^i}. 
Jets were defined with the fcj_-algorithm with D = 0.4. Muhiple interactions and hadronisation 
were switched off. Plots produced with PYTHIA8 Tune 4C. The bottom in-sets show the deviation 
of the merged samples from default, first-order corrected PYTHIA8. 



jets is stable once the maximal number of merged jets is increased above n, as was found 
in [21]. 

One possible argument for this effect is that when looking at the k±2 separation at 
which a jet ai and a jet 02 are clustered into a single jet in ME3PS, the parent jets 61, 62, ^3 
which produced ai and 02 were harder than in ME2PS, thus again favouring harder jets 
ai,2, i.e. larger separations, k±2- The stabilisation could then be explained by assuming 
that the parent jets producing 61,2,3 in ME4PS will not greatly increase the hardness of 
61,2,3 because in ME4PS, most jets will be just above the merging scale due to a steeply 
falling /c_L4 spectrum. 

However, in our implementation, the question arises if a stable k±2 distribution will 
also be stable to changing the value of the merging scale. First, notice that there is no shape 
change in the = 45 GeV curves when going from ME2PS to ME3PS (or ME4PS), even 
though by the above reasoning, further distortions should be more pronounced at high 
merging scales. It is critical to notice (see Figure 14) that for low merging scales, the 
spectrum in ME4PS is significantly harder than the ME4PS reference at = 30 GeV, 
whereas once their initial ascend is over, the curves for ^ms = 45 GeV nicely join the ^ms = 30 
GeV ones. These observations can again be explained by unitarity violation for Ims = 10 
GeV and t^g = 15 GeV, which stabilise when merging more jets, but do not decrease. 
Since the changed cross section is stable while the sample composition changes between 
ME2PS and ME3PS, the shape of k±2 has to change. In support of this rationale, the right 
panel of Figure 14 shows that when reducing unitarity violations by not enforcing rapidity 
ordering in the shower, the effects are significantly reduced as well. These considerations 
can be applied to jet separations k±n>2 as well. 
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Figure 14: k±2 separation of the second jet in W+ jets events at Eqm = I960 GeV in pp collisions. 
The curves are normalised to the k±2 distribution in ME4PS at ^ms — 30 GeV. Jets were defined 
with the fcj^-algorithm with D — 0.4. Multiple interactions and hadronisation were switched off. 
Left panel: Plots produced with PYTHIA8 Tune 4C. Right panel: Plots produced with PYTHIA8 
Tune 4C, with enforced rapidity ordering switched off. 

Every parton shower relies on phenomenological models to confine partons into 
hadrons, thus making systematic tuning is a critical step in the development of an event 
generator. Tuning however should not hide the shortcomings due to approximations made. 
If residual tuning effects because of correlations between tuning parameters remain in phase 
space regions with well-separated jets, we expect such changes to be stabilised when cor- 
recting with higher multiplicity matrix elements. The impact of changing between different 
tunes in PYTHIA8 is shown in Figure where we show the results of using Tune 2C, Tune 
4C and forcing as(Mz) = 0.129783 (the CTEQ6L1 fit value) in all components of PYTHIAS, 
in comparison with ATLAS data [25]. We find that the pp — ?■ W+jets predictions are fairly 
stable with respect to changing tunes. As expected, we observe that the ME3PS sample is 
harder than default PYTHIAS. 

Finally in Figure |l^ we show the effect of our treatment of multiple interactions. The 
associated hadronic activity in Z production events, especially in the azimuthal direction 
direction of the Z, is very sensitive to underlying event effects, and hence also to multiple 
interactions [27]. In our merging scheme we have been very careful to make sure that 
multiple interactions are treated exactly the same way as in standard PYTHIAS without 
inclusion of matrix element configurations. And, as seen in Figure the differences 
between the merged sample and default PYTHIAS are indeed very small. 

5.4 Di-boson and QCD jet production 

Our implementation is in principle general enough to be applied to any process that can 
be handled by PYTHIAS. However, in this publication, we restrict ourselves to two further 
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Figure 15: Transverse momentum of the second hardest jet in W + jet events as measured 
in the electron channel by ATLAS [25], for different tunes. "2C" indicates that Tune 2C was 
used, while "4C" uses Tune 4C, the current default tune in PYTHIA8.157. The label "a^ = 0.13" 
stands for fixing as(Mz) = 0.129783 in Tune 4C, as discussed in the text. The merging scale is 
iivis = niin{fc^i} = 30 GeV. Effects of multiple interactions and hadronisation are included. The 
plot was produced with RIVET [24] . 



examples. First, let us examine di-boson production, with one of the bosons decaying 
hadronically. Allowing hadronic decays of weak bosons in the underlying Born process 
provides another complication, and for technical reasons we here restrict the matrix ele- 
ment to only produce extra jets in from the incoming partons, while additional jets in the 
hadronic boson decay are only produced by the shower. As the first emission in the boson 
decay is anyway ME-corrected in standard PYTHIA8, this is not a severe restriction. Note 
however that this means that we have to treat emissions from the boson decay on the same 
footing as multiple interactions (and QED radiation). This means that they are included 
in the Sudakov form factors generated from reclustered states, but when showering from a 
n < N state, if the first emission is from the boson decay, the event is never vetoed. The 
partons from the boson decay are also not involved in the reclustering of matrix element 
states. 

The performance of our implementation concerning these issues can be tested when 



merging pp — t- W^Z — t- e^v^ jj+jets matrix elements. The left panel of Figure 17 shows 
that also in the case of di-boson production, the k±3 spectrum becomes harder on inclu- 
sion of additional jets. There are no visible differences in the default PYTHIA8 results 
when changing between only ordering emissions in evolution p and ordering both in p and 
rapidity, since k^^ is dominated by the hardest shower emission, which is not affected 
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Figure 16: Toward region charged particle density and average pj_ in Drell-Yan events, as measured 
by CDF [27]. The merging scale is ^ms = inin{fc_Li} = 30 GeV. Effects of multiple scatterings and 
hadronisation are included, and Tune 4C was chosen. The plots were produced with RIVET [24]. 



by the additional rapidity ordering. We observe only small differences between merged 
samples with and without enforced rapidity ordering in the shower. Relative changes in 
/c_i_3 are, as expected, comparable to the effects on k^i when including additional jets in 
pp —7- —7- e^i^e (see e.g. Figure 13). We have checked that different jet definitions do 
not change this trend. 

A consequence of harder jets can be seen in the right panel of Figure 0, where we 
show the di-jet invariant mass distribution with cuts and jet definition from CDF [28]. 
The spectrum develops a harder tail compared to default PYTHIA8. Particularly in the 
region 140 < rrijj < 200 GeV we find an increase around 10%. Also, the distribution is 
sensitive to the unitarity violations due to enforced rapidity ordering, so that care has to 
taken when comparing MEPS distributions to experimental data. In [28], the shape of 
the di-boson backgrounds was modelled by PYTHIA6.216, which should behave similar to 
default PYTHIA8. Merging additional jets in pp — )• W+Z — )• e+^e jj can affect the shape of 
the di-jet invariant mass spectrum in a way which will reduce the significance of the effect 
found by CDF. We plan to further investigate these issues in a future publication. 

Finally, we examine QCD jet production. For such events, we set the shower start- 
ing scale for the 2 — )• 2 process to the transverse momentum of the outgoing partons. 
The maximal scale for secondary scatterings is set to the same value. In PYTHIA8, users 
are generally allowed to choose a different prescription of setting a maximal scale of mul- 
tiple interactions, e.g. the energy of the colliding hadrons. Adopting this example, we 
risk double-counting configurations, since interactions identical to the hard process can be 
generated. 

To remove this double counting, an additional veto on the transverse momentum of 
multiple interactions in the trial shower has to be applied. We have checked that when 
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Figure 17: Sample results of including matrix elements with additional jets in pp — > W+Z — >■ 
e'^Vc jj events, at Ecm = I960 GeV in pp collisions. The merging scale is defined in Ims = minjfcj^i}. 
Multiple interactions are included. Curves with enforced rapidity ordering in the shower carry an 
additional label "y-ordered". The bottom in-set shows the deviation of the merged samples to 
Pythias. Left panel: fc^3 of the third and second hardest jet at hadron level. Jets were defined 
with the /cj^-algorithm with D = 0.4. Right panel: Di-jet invariant mass in at hadron level. Cuts are 
taken from the recent CDF publication [28] . Jets were defined with the CDF JETCLU algorithm [29] 
as implemented in fastjet. 



allowing secondary scatterings up to the kinematical limit and applying a veto, distributions 
are not changed with respect to setting pMi,max = P±,2^2- The results presented here 
have been produced with fixing the starting scales for the hard process to the transverse 
momentum, as is the default in PYTHIA8. 

Figure 18 shows that for QCD jets as well, inclusion of additional jets increases the 
hardness /c_l3 of the third jet. Compared to the changes in k±i for W-l-jets, the effect 
is, however, moderate. This is in accord with the findings in [20], which showed good 
agreement in the p± of the softest of three partons (there called p±5), when comparing 
2 — )• 3 matrix elements to the default shower after the first emission from a 2 — )• 2 core 
process. There, the shower was slightly harder than the matrix element until p±^ ~ 80 
GeV. A similar effect can be seen in the /c_l3 separation of jets, which is related to the p±^ 
of partons. 

The inclusion of a sample with two additional jets does not change the situation dra- 
matically, leading us to the conclusion that once the first few hard jets are generated 
according to the tree-level matrix elements, the parton shower does a fairly good job in 
describing the hardness of additional jets. This is supported by the upper panel of Fig- 
ure [l^, showing the /c_l3 separation between the third and second hardest jets in W-l-jets 
events. Clearly, there are only little changes in the hardness of the third jet when going 
from ME2PS to ME4PS, i.e. the merging has less impact once a couple of jets are included 
from the matrix element states^. 



^As can be seen in the lower panel of Figure |l^, this statement does not hold if there are major unitarity 
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Figure 18: k±3 separation of tlie tliird jet in pure QCD jet events at Ecu — I960 GeV in pp 
collisions. The merging scale is t-Ms — 30 GeV. Jets were defined with the fc^-algorithm with 
D = 0.4. Multiple interactions are included and hadronisation was switched off. Curves with 
enforced rapidity ordering in the shower carry the label "y-ordered" , while results without explicit 
rapidity ordering are labelled "y-unordered" . The bottom in-set shows the deviation of the merged 
sample default PYTHIA8. 



Coming back to pure QCD, we show in Figure ^ that also in this case, results for 
k±s are fairly stable when changing the merging scale. We register only small unitarity 
violations of 0(10%), which matches the changes in k±2, in W-|-jets events when going from 
ME2PS to ME4PS without requiring rapidity ordering, as illustrated in the upper part of 
Figure As W -|- n jets contains colour configurations similar to di-jet-|-(n — 2) jets, this 



is another indication for the consistency of the implementation. However, in Figure 20, we 



find only minor changes between different treatments of rapidity ordering for di-jet events, 



whereas for W-l-jets events, we observe dramatic effects (see lower plot in Figure 19). This 
can be explained by the fact that when requiring rapidity ordering, PYTHIA8 orders all 
emissions after the first shower emission in rapidity, meaning that for di-jet events, k±s is 
virtually unaffected by the constraint, while in W-l-jets events, major restrictions on the 
phase space of the second and third jet lead to large unitarity violations. This argument 
is substantiated by Figure which shows that once rapidity ordering becomes relevant, 
the additional ordering results in larger deviations for low merging scales. 

It is worth noting that since jet spectra are not changed dramatically when including 
additional jets, only small differences are expected when comparing to experimental data. 



violations - which should be avoided anyway. 
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Figure 19: k±3 separation of the second and third jet in W+jets events at Ecu = I960 GeV in 
pp colUsions. The curve shows the deviation in fc_L3 of ME4PS for three different merging scales, 
with respect to ME2PS for ^ms = 30 GeV. Jets were defined with the fc_L-algorithni with D — 0.4. 
Hadronisation and multiple interactions were switched off. Curves with enforced rapidity ordering in 
the shower carry the label "y-ordered" , while results without explicit rapidity ordering are labelled 
"y-unordered" . 



In Figure we examine the description of CDF jet shapes [30] for two exemplary p'^^ 
bins. For a in 55 GeV < < 63 GeV, we find only very minor changes. However, 
for higher p^^ in the region 128 GeV < p^l < 148 GeV the differences between default 
PYTHIA8 and the merged sample ME2PS with two additional jets are more pronounced, 
and we see that the latter gives a slightly broader shape. This is expected as at high 



transverse momentum the effect of the harder third jet in Figure 18 should come into play, 
resulting in more jets containing two partons from the matrix element. Such jets are of 
course broader. 

When checking differential jet shapes for other bins, we find that ME2PS does as 
good or slightly better than PYTHIA8 for p^^ < 120 GeV, while decreasing too slowly for 
p^^ > 120 GeV. This indicates that at least some revisions need to be made when tuning 
matrix-element-merged PYTHIA8 to pure QCD jet data. Since the influence of multiple 
interactions on jets with p^^ > 120 GeV is likely to be small, a possible new tune would 
potentially feature changes in as (Mz) and other parameters to prescribe the physics of 
hard jets. 
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Figure 20: fcj^3 separation of tlie tliird jet in pure QCD jet events at Eqm = 1960 GeV in pp 
collisions, for three different merging scales. Jets were defined with the fc^-algorithm with D = 0.4. 
Multiple interactions and hadronisation were switched off. Curves with enforced rapidity ordering in 
the shower carry the label "y-ordered" , while results without explicit rapidity ordering are labelled 
"y-unordered" . Left panel: fcj^3 separation of the third jet. Upper right panel: Deviation in k±3 
between default PYTHIA8 (Tune 4C) and PYTHIA8 (Tune 4C) with no enforced rapidity ordering. 
Centre right panel: Deviation in fcj_3 between ME2PS sample and PYTHIA8 (Tune 4C) with no 
enforced rapidity ordering. Lower right panel: Deviation in k±3 between ME2PS sample and 
defauh Pythias (Tune 4C). 

6. Conclusions and Outlook 

We have implemented Ckkw-l merging inside the PYTHIA8 framework, and have shown 
that it works well for several sample processes: e^e~ — )■ jets, (di-) boson and pure QCD 
jet production in hadronic collisions. The implementation is, however, quite general and 
could be used for any process which PYTHIA8 is able to handle. 

The algorithm is true to the Ckkw-l spirit, in that if matrix element samples are 
provided for up to extra partons, every event where the n < N hardest (in the parton 
shower sense) partons can be produced by the matrix element, it will be evolved from the 
corresponding matrix element state. 

By construction, the dependence on the merging scale vanishes to the logarithmic pre- 
cision of the Pythias parton shower. Nevertheless, we find visible sub- leading effects due 
to different choices that can be made in the procedure. In particular we have investigated 

• different ways of choosing parton shower histories, 

• different strategies for handling unordered histories, 

• different starting scales for incomplete histories, 

• different options for including multiple scatterings. 

In all these cases we found the effects to be small. 
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Figure 21: k±5 separation between the fourth and fifth jet in di-jet events at Eqm — I960 GeV 
in pp collisions. The curves show the deviation of A;_l 5 -distributions in ME2PS for three different 
merging scales, with respect to PYTHIA8. Jets were defined with the fcj^- algorithm with D = 0.4. 
Hadronisation and multiple interactions were switched off. Curves with enforced rapidity ordering in 
the shower carry the label "y-ordered" , while results without explicit rapidity ordering are labelled 
"y-unordered" . 

However, we found that in some cases there are large merging scale dependences from 
unitarity violations. These problems have been noted before in other CKKW-based algo- 
rithms [18,19], and arise from the fact that what is exponentiated in the Sudakov form 
factors is only the parton shower approximation to the matrix elements, rather than the 
matrix elements themselves. In addition, the phase space integrated over in the Sudakov 
may differ from the full phase space available to the matrix element. 

For our implementation, one would expect the unitarity violations to be diminished 
in the cases where PYTHIA8 already include a matrix-element reweighting of the first 
parton shower emission (similar to what is done in POWHEG [12,31]). However, we found 
that the effects on the contrary are very large, and traced the reason for this to the fact 
that the default tune of PYTHIAS uses a rapidity ordering for the initial-state shower 
in addition to the ordering in the transverse momentum evolution scale. This results in a 
severe restriction of the phase space over which Sudakov form factors are integrated, giving 
increased merging scale dependences. When removing the rapidity ordering, the unitarity 
violations are reduced to an almost negligible effect. 

An important result of our investigations is that the PYTHIAS shower (without enforced 
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Figure 22: Jet shapes in QCD events as measured by CDF [30]. The merging scale is tMS = 
min{fc_Li} = 30 GeV. Effects of multiple scatterings and hadronisation are included. The plots were 
produced with RIVET [24] . 

rapidity ordering) actually is quite good at describing the hardness of multi-jet events, as 
long as the hardest few jets are generated according to the exact matrix elements. Of 
course, there may be special observables related to details in the correlations between jets, 
where merging with high-multiplicity matrix elements is still necessary to get a correct 
description, but for the main features of multi-jet event it seems to be enough to merge 
with a limited number of extra jets. 

Before our Ckkw-l implementation can be used for reliable predictions and compar- 
isons with experimental data, the parameters of PYTHIA8 need to be retuned. We have 
shown that for e~''e~ — )• jets, the merging with multi-jet matrix elements barely changes 
the description of data, and we can assume that the parameters for the hadronisation and 
final state showers will not need to be substantially changed. Furthermore, for pure QCD 
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processes in hadronic collisions, the effects of multi-jet merging are again very modest, 
except for very high transverse momentum jets, so also for minimum bias and underlying 
event observables the tuning needed can be assumed to be minor. On the other hand, for 
electro-weak processes and for very hard jets in pure QCD processes in hadronic collisions 
the merging gives quite substantial effects, which means retuning is necessary. To get stable 
results, this new tune should be done without the rapidity ordering discussed above. 
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A. Comments on the logarithmic accuracy of Ckkw-l 




Figure 23: A schematic view of how the two-parton phase space is filled in Ckkw-l. The figure 
illustrates how two partons at evolution scales pi and p2 can be classified in terms of a merging 
scale tjvis defined in the variable t. The vertical axis is the shower ordering scale (which is different 
from the t-scale) and the horizontal axis is an auxiliary splitting variable. The different states are 
evolved from different matrix-element samples: (a) starts from the 2-jet ME, (b) the 0-jet ME, (c) 
the 1-jet ME, and finally (d) is evolved starting from the 0-jet ME. 



Here, we would like to elaborate on how the logarithmic accuracy of the shower is 
preserved in Ckkw-l. This discussion is independent of the functional form of the merging 
scale variable. For the sake of illustration, let us analyse how the phase space for two 
additional partons is filled by the merging. As shown in Figure there are four different 
ways in which two shower emissions at scales pi and p2 can be classified in the variable 
t. Let the merging scale ty^s separate the regions of low and high t, as sketched in the 
Figure H 

In panel (a), the two partons are inside the matrix element region and as such should 
be generated by the matrix element generator. Sudakov form factors are added in the 
way discussed in 4^. Notice that this is done by performing trial showers, i.e. that we 
discard the event if pi > ptriai > P2, and we never change reclustered states or the matrix 
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element configuration. In this way, for non-zero weighted events, every ME configuration 
is treated exactly as in the parton shower. It is also clear that panel (d), a configuration 
with two jets which are soft in t should come from the parton shower. Panel (c) provides 
the next complicated region. Since region (a) is already correctly accounted for, we veto 
shower emissions that would evolve the state into one in (a). This means that all states 
which evolved from a matrix element state with one jet above tMs, and which have non- 
zero weight, i.e. with the first emission below ^ms or no emission at all, are produced by 
PS evolution. As outlined in section L3, Sudakov form factors below the reconstructed 
evolution scale of the ME emission are added by using the full shower when producing 
trial emissions. We ensure in this way that form factors are added to the ME one-jet 
configuration in the correct parton shower manner. 

To our understanding, the truncated shower approach and Ckkw-l only differ in their 
treatment of panel (b). Let us clarify this statement. In Ckkw-l, we define the ME region 
to contain the n < N hardest jets in the evolution variable p, which are also above the cut 
in t. Once we are inside the PS region, we believe the shower is performing well, so that 
all further emissions will be taken from the parton shower. The first emission in panel (b) 
is already in the PS region. Thus, every further splitting is taken from the shower. For the 
example this means that this two jet state is generated from the 0-jet ME sample. Since 
the shower is the only ingredient in how the state is produced, the accuracy of the shower 
is preserved. 

Truncated showers differ, in that they allow what we call a pure PS state to evolve 
into something which could have evolved from a ME 1-jet state. Here, the emission is kept, 
the reclustered state changed, and evolved further until the scale p2 is reached. Then, an 
emission with the reconstructed ME splitting variables is forced. In this way, the path how 
the state was reached is correctly described, and the accuracy of the shower is retained. 
Truncated showering is allowed if the emissions that were inserted before the hard emission 
are soft and did not change the flavour of the line that will emit the hard jet. 

This example reveals the different philosophies behind the Ckkw-l and Truncated 
Shower approaches. In Ckkw-l, a compromise is made in that only the n < N emissions 
hardest in the evolution variable, and above the merging scale cut ^ms; are corrected with 
matrix element configurations. Thus, in comparison to using truncated showers, a smaller 
region of phase space have a matrix element structure. However, we are allowed to use the 
full shower to generate no-emission probabilities. 

When using truncated showers, the flavour of the splitting lines has to be conserved 
in order to be able to attach the ME emission, i.e. truncated showers only allow gluon 
emissions. Also, splittings in the truncated showers cannot be allowed to remove too much 
momentum from the line, since otherwise, the ME emission could not be forced. These 
restrictions make the Sudakov form factors differ slightly between the full shower and 
the truncated shower, though differences are sub-leading and might be tiny in an actual 
implementation. 

Summarising, we believe that both Ckkw-l and the Truncated Shower approach have 
to compromise in regions with i^s-unordered splittings. In Ckkw-l, only the hardest 
partons in the evolution variable will be corrected with tree-level matrix elements, as long as 
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they are above Ims a-s well. This effectively means that the shower evolution variable should 
be some measure of hardness, since otherwise, only small regions of the relevant phase space 
will be endowed with corrections. Choosing e.g. a shower with an ordering variable defined 
by angles would not be suitable. Truncated Shower prescriptions allow correcting larger 
parts of the phase space with ME configurations, though at the expense of compromising in 
the generation of Sudakov form factors. This approach is particularly suited if the evolution 
variable does not provide a hardness measure, since then, the differences in the Sudakov 
form factors are vanishing, while large fractions of the phase space can be described by ME 
emissions. Since the evolution in transverse momentum provides a good hardness measure, 
Ckkw-l provides a natural merging scheme in PYTHIA8. 

B. Reconstructing shower splitting probabilities and intermediate states 

In a numerical fixed order calculation, different Feynman graphs can contribute to a par- 
ticular phase space point. The analogue in a parton shower is that a multitude of different 
sequences of shower splittings can fill the same phase space point. We describe in this 
appendix the construction and choice of parton shower histories. The prescriptions below 
are implemented in PYTHIA8, with the code being publicly available from version 8.157 
onwards. Given a matrix element state -S+n, all possible intermediate states, splitting prob- 
abilities and splitting scales are reconstructed. We first detail how splitting probabilities 
are calculated and used to choose a particular path of shower splittings. We will after this 
outline how intermediate states are constructed. 

B.l Calculation of splitting probabilities 

When assigning a parton shower history to a matrix element state, we have to decide on how 
to choose amongst all possible splitting sequences. Our choice of a suitable discriminant 
between these "paths" is guided by the coUinear factorisation of n-particle matrix elements: 

dan = J~-n{Xn,tn)Fn \Mnf' d^ji ~ 

«s 1 X ^ / X „ , . . ,9 dk"^, dz d(b ,^ , s 

^-^P{z)Cn{Xn,tn)Fn\Mn-lf ^^^^^ ^ 

where is the flux factor and C^n the parton luminosity for the m-parton final state using 
the factorisation scale t„, while (fc^, z, 0) are the splitting variables, the virtuality of the 
splitting parton, and P{z) is the DGLAP splitting kernel for the splitting. The integration 
measure is given by 

d^m = dcl>m^^, (B.2) 
Xm Xm 

where d4>m is the m-particle phase space volume and are the momentum fractions of 
the incoming partons moving in itz direction. Using the fact that 

^niXnitn) — X^fn {Xn ''^'n)'^n fn (.•^n'^n) ; with — £-n—li,Xn—lTtn—l) for FSR,, 

and (B.3) 
^ ^ iFn-i for FSR 
" 1 zFn-i for ISR 
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as well as the definition of the evolution variable in eqs. (^^) and (|3.5| ), we can write the 
factorised transition cross section as 



dan 



2^mdkldz^] dan-l, 



n-l 



for FSR; 
for ISR. 



(B.4) 



To illustrate initial state radiation, we have here chosen the parton moving along +z 
direction to split. Iterating this procedure down to the desired Born-level state represented 
by m = 0, we can construct one path of collinear splittings by which we may have arrived 
at the n-particle state. We can use the sum over all different possible paths p, 



dan 



n 



Xipfip{xip, Pip) Pip{Zip) 2 J d(f)ip 

— aK^^pdZip 

Pip ZTT 



EH- 

p '^'^ Xi — \pfi—\p{Xi—\p, Pip) 



dao 



(B.5) 



where {pf^, Zip, (pip) and Pip are the reconstructed splitting variables and splitting function 
for the i'th splitting in the path p, as an approximation of the n-parton cross section. The 
PDF ratio will equal unity for final state splitting. We can make this correspondence exact 
for the very first splitting, by adding matrix element corrections to the splitting kernel and 
finding a common integration measure for the joined evolution along the possible paths, 
as will be addressed in the following. The very first emission can be attributed to either a 
splitting of dipole end "1", or of dipole end "2". If the momentum of dipole end i G {1,2} 
after the splitting is pi, and the momentum of the emitted parton is ps, we define 



Qlq^p 




for FSR 
for ISR 



.2 
Zli 



''T^Dip 



with Xk 



Xi+X-j, 1 
(Pl+P2-P3)^ 
~~{pi+P2p 



(P1+P2+P3F 



{pi+P2+Pzf for FSR 



for FSR 
for ISR 



With this notation, a joined evolution equation is given by 



dV^ 



p=i 



as Pip{zip)Pf 



pMEcorr 



ip 



E 



Xlpflpi^Xlp, Pip) 



■[ 27r xop/op(xop,pip) 



-Plp(^lp)-f; 



1 - zip)m%^p 

Q\l + (5x2 



dp]_dy 



pMEcorr " 



(1 ^lp)Qlg^p + Preg 
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dQ'^dz . (B.7) 



(1 - Zip)'^Ql^Ql2 + (1 - Zip)'^PregS + pleg 

The common integration measures for both paths were defined by introducing the variables 



pI 



QliQ 



2 

12 



1 Q 

y = - In — for the FSR case, and 

2 (^ii 

dQ^ = d|(5ii,i2| z = zii = z\2 for the ISR case. 



Dip 



(B.^ 
(B.c 
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For initial state splittings, the weight takes a more complicated form since in PYTHIA8, 
infrared singularities are regularised by the introduction of a small scale preg- This is 
inspired by the regularisation of multiple interactions using arguments relating to colour 
screening effects [20]. For vanishing preg^ the weight for the first splitting of initial particles 
again takes the form given in eq. ( [B.4D . Note that we keep fixed, as the running of 
Os is corrected for later in the algorithm. Also, the change in incoming parton content, 
compensated by ratios of parton distributions, will be corrected later on. Hence, the weight 
for each splitting should not contain or PDF ratio factors. The product of these weights 
of individual splittings in a path will then be used as the weight when choosing a path with 
the normalised probability 



where (B.IO) 



Wlp{zip) 



i=2 

-Plp(zip)-PpMEcorr (-'-"^IP )"^l5ip 

1 

Ip 



Qii+'9i2 



D (^^ \p (1 2lp)Qi^^p+Preg fnrTG!R 



for FSR 

(B.ll) 



In section |5|, we compare this probabilistic prescription with a winner-takes-it-all strategy 
based on the smallest sum of transverse momenta, and observe minor, though visible, 
differences. 



B.2 Reconstruction of intermediate states 

Given an n-parton phase space point 5+„ from a matrix element generator, we explicitly 
construct all possible intermediate states S+q . . . , S+n-i in all paths p by reclustering al- 
lowed shower emissions. For the construction of the state S+j, given that we have the state 
5+4+1, this rather complicated step is achieved by inverting all the changes the shower 
would have applied in the construction of the emission. This means that we need construct 

1. The underlying momenta p = {po, . . . ,Pk+i} from the momenta p = {po, . . . ,pk+i+i}; 

2. The underlying flavour configuration J^+j from the configuration J^+j+i; 

3. The underlying colour configuration C+j from C+j+i. 

In the following, write "before" for values before the clustering, and "after" for values after 
the clustering. 

Reclustering of momenta 

The construction of the reclustered momenta p from the momenta p is achieved by exactly 
reverting all changes PYTHIA8 would have done if the showers would have constructed 
an emission resulting in the momenta p. Formally speaking, this means that we invert 
the radiative phase space mapping of the shower. The construction of the momentum of 
an emission in PYTHIA8 differs between initial state and final state splittings, leading to 
different prescriptions how underlying kinematics p are constructed. 
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For a final state emission witli a final state recoiler, this means that the momenta of 
the reconstructed radiator and recoiler in the rest frame of the dipole are set to 
u _ / rriDip mDvp\ „ - ( o o _!!^ 

^radiator, after — ^i 2 ' 2 / ' -f^recoiler, after — 2 ' 2 / 

and then rotated and boosted from the rest frame of the decaying dipole^ to the event 
centre-of-mass frame^. 

For final state splittings with an initial state recoiler, the shower would have taken the 
energy (four-momentum) for the emitted particle from the beam. This steps is undone after 
the boost to the event centre-of-mass frame by setting the recoiler momentum according 
to 

/i 2 M i") after Lorentz transformation 

P recoiler, after P recoiler, before P recoiler, after 

For initial state splittings, PYTHIA8 distributes the recoil among all final state particles, 
making the inversion of this momentum mapping more complicated. We will denote all 
unchanged momenta of the original 2 — t- n process by pi. The momenta of the partons 
involved in the splitting are denoted by Pmother; Psister and ^partner- When referring to 
Pmother) Psister or ^partner in the foUowiug we always think about the momenta of these 
particles at the current step in the construction of reclustered kinematics. Inverting the 
construction of kinematics of PYTHIA8 proceeds as follows: 

1. Undo the rotation with 

/ j_ 1 [Psisterjj/ 

(f) = arctan I p 

\ LPsister J x , 

that Pythias would have done, by rotating all momenta with —cj). 

2. Transform all momenta from the event centre-of-mass frame^ to the centre-of-mass 

frame^ of the momenta = pj^^^ghtcr = ^mother " P^istcr and pj'eeoilcr = ^partner- Notice 
that we transform to the centre-of-mass frame of the off-shell momentum J^daughter • 

3. Undo the 



—0 = — arctan 



•^[Pmotherjx ~^ [Pmother]^ 



mother I z 



rotation that PYTHIAS would have done by rotating all momenta with 6. 
4. Construct the on-shell momenta ^daughter and Precoiier by resetting 

f'daughter = ( 0, 0, ^S, ^s) p^^^^jj^^ = (^0, ^S^ , 



^Defined by pradiator, before + Pemitted, before aligned along -|-«-direction, precoiier, before aligned along -z- 
direction 

^Defined by the orientation of the momenta Pradiator, before + Pemitted, before and Precoiier, before taken from 
the unchanged 2 — >■ n state, where these are not anti-parallel. 

^Defined by the orientation of the momenta pdaughter andprecoiier in the rotated, but otherwise unchanged 
2 — >■ n process. 

^Defined by pkaughter being aligned to the -l-a-direction and precoiier aligned to -a-direction. 
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where 

s-zx.x,.E^ and - ^ xo - ^ K+P^-P^f 
. - ^^1^2 ^CM and X, - , x,-^^^, z- ^ . 

5. Boost along the z-axis to the frame where the energy fraction of the newly constructed 
^'recoiier Original valuc X2, i.e. along the vector 

( Z-XI-X2 

Pboost = 0, 



Z- X1+X2, 

After this boost, the newly constructed P^ecoiier should be identical to 
6. Undo the initial —(j) rotation by a rotation with ^. 

These changes allow us to reconstruct the state from which PYTHIA8 would have con- 
structed the matrix clement momenta if the shower would have produced a splitting at the 
reconstructed splitting scale. We tested that this method of inverting the shower splitting 
kinematics exactly reproduces lower multiplicity states from states with additional shower 
emissions. We found complete agreement, since this construction explicitly inverts the 
momentum mapping of the shower. 

Reconstruction of the underlying flavour structure 

To reconstruct the intermediate state S'+j, we further have to assign the correct flavour 
structure T+i. With the notation f{k) and f{k) for the flavour of particle k and the 
antiparticle to k, the flavour mapping can be accomplished by following the rules: 

1. If the emitted parton is a gluon, then 

/(radiator, after) = /(radiator, before) 

2. If the emitted parton is a quark, and the radiating parton is a gluon, then 

/(radiator, after) 



(/(emitted, before) in FSR, 
/(emitted, before) in ISR, 



3. If the emitted parton is a quark, and the radiating parton is the corresponding anti- 
quark, then 



/(radiator, after) 



g in FSR, 

not possible in ISR, 



4. If the emitted parton is a quark, and the radiating parton is a quark of the same 
flavour, then 



/(radiator, after) 



not possible in FSR, 
g in ISR, 



This exhausts the list of QCD flavour mappings in PYTHIA8, so that following these rules, 
the flavour configuration F+i of the state 5+i can be reconstructed. 
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Reconstruction of the underlying colour structure 

Finally, we need to construct the colour configuration C+j. Let us write c (c) for the colour 
(anticolour) of partons, and indicate the parton to be reconstructed by a subscript r. After 
flavours have been assigned, the colour of the parton p^ can be found by following the rules 

1. For final state splittings with a gluon involved as either emitted or radiating parton, 
i.e. 

<ir qgemt , Qr ^ gQemt , Qr ^ Qgemt , Qr ^ gQemt , gr ^ ggemt (FSR) 

remove the index appearing both as colour and anticolour in the (emitted, radiating)- 
parton pair. Set the leftover colour and anticolour as the colour and anticolour of p^, 
i.e. 

''qr ~ ' ^qr ~ ^C/emt 

^Sr = (^gemt ) ^g^ = Cg^^^ Or Cg^ = Cg^^^ , Cg^ = Cg^^j 

The second possibility for g,. — )• ggemt can occur if the matrix element generator 
produced a non-planar colour flow. 

2. For final state splittings with quark and antiquarks as emitted and radiating partons, 
i.e. 

gr qqemt , gr ^ QQemt (FSR) 

set the colour of g^ to the quark colour, the anticolour to the antiquark anticolour. 
This means 

Cg^ = Cq , Cg^ = Cq 

irrespectively of whether the quark or the antiquark is considered the emitted parton. 

3. For initial state splittings with an emitted gluon, i.e. 

g grgemt , Q ^ Qrgemt (ISR) 

remove the index appearing as colour (or anticolour) both in the emitted and radiating 
parton. If a colour (anticolour) index remains in the initial state, set the colour 

(anticolour) of p^. to the remaining initial state colour (anticolour), and set the p^ 
anticolour (colour) to the remaining final state colour (anticolour) index. 

4. For initial state splittings with an emitted quark (antiquark) and a gluon radiator, 
i.e. 

g Qr-qemt , g ^ Qrlemt (ISR) 

set the Pr colour (anticolour) to the colour (anticolour) of the radiating gluon. 
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5. For initial state splittings with an emitted quark (antiquark) and a quark (antiquark) 
radiator, i.e. 



q grqemt 



q grqemt 



(ISR) 



set the gr colour (anticolour) to the anticolour (colour) of the emitted parton. Set 
the reconstructed gluon anticolour (colour) to the anticolour (colour) of the radiating 
parton. 

Once a pair of radiating and emitted partons is chosen, these rules can be applied to deduce 
the colour configuration C+j of the state S+i . 

Combining the inversion of the parton shower kinematics, the construction of the 
underlying flavour configuration and reclustering of colours, the complete state S+i can 
be generated. We have extensively tested that, given a state S'+n, our implementation 
exactly reproduces all states 5'_|_(m<„), if the states S'+m+ii • • • S+n were generated by shower 
splittings, verifying that we have used the exact inversion of the radiative mappings of 
Pythias. 
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